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O Abstract 

(N 

We consider spherical data Xi noised by a random rotation Si € SO(3) so that only the 

sample Zj = £j.?Q, i = 1, . . . ,N is observed. We define a nonparametric test procedure to 

distinguish Ho ■ "the density / of Xi is the uniform density /q on the sphere" and Hi : 

"11/ — /oil 2 ^ C^Pn an d / is in a Sobolev space with smoothness s". For a noise density f E 

with smoothness index v, we show that an adaptive procedure (i.e. s in not assumed to be 

known) cannot have a faster rate of separation than ippf(s) — (N/^\og\og(N))^ 2s ^ 2s+2 " +1 " > 

[— h and we provide a procedure which reaches this rate. We also deal with the case of super 

smooth noise. We illustrate the theory by implementing our test procedure for various kinds 

_J of noise on SO(3) and show that it yields promising numerical results. 

+-> 

Keywords : Adaptive testing, spherical deconvolution, minimax hypothesis testing, nonpara- 
metric alternatives, spherical harmonics. 
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> 

00 1 Introduction 

O 

fvj We consider the spherical convolution model. We observe: 

Zi = £iXi, i = l,...,N (1) 

(N 

t— ( where the £j are i.i.d. random variables of SO(3) the rotation group in M 3 and the X_ s are i.i.d. 

random variables of S 2 , the unit sphere of M 3 . We suppose that Xi and £{ are independent. 
We also assume that the distributions of Zi and Xi are absolutely continuous with respect to 
the uniform measure on § 2 and we set fz and / the densities of Z{ and Xi respectively. The 
distribution of £j is absolutely continuous with respect to the Haar measure on SO(3) and we 
will denote it f e . Then we have 

fz = fe* /) 

where * denotes the convolution product which is defined below in (pi). 

Roughly speaking, the density spherical convolution model provides a setup where each gen- 
uine observation Xi is contaminated by a small random rotation. The aim of the present paper 
is to provide nonparametric adaptive minimax goodness-of-fit testing procedures on / from the 
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noisy observations Zj. More precisely, let /o being the uniform density on §> 2 , we consider the 
problem of testing the null hypothesis f = fo with alternatives expressed in L2 norm over Sobolev 
classes. 

To the best of our knowledge, goodness-of-fit testing in spherical convolution model does not 
exist. Our results are strongly motivated by astrophysical interests. In astrophysics, a burning 
issue consists in understanding the behaviour of the so-called Ultra High Energy Cosmic Rays 
(UHECR). These latter are cosmic rays with an extreme kinetic energy (of the order of 10 19 eV) 
and the rarest particles in the universe. The source of those most energetic particles remains 
a mystery and the stake lies in finding out their origins and which process produces them. 
Astrophysicists have at their disposal directional data which are measurements of the incoming 
directions of the UHECR on Earth. 

Needless to say that finding out more about the law of probability of those incoming directions 
is crucial to gain an insight into the mechanisms generating the UHECR. Several hypotheses are 
made. A uniform density would suggest that the High Energy Cosmic Rays are generated by 
cosmological effects, such as the decay of relic particles from the Big Bang. On the contrary, 
if these UHECR are generated by astrophysical phenomena (such as acceleration into Active 
Galactic Nuclei (AGN)), then we should observe a density function which is highly non- uniform 
and tightly correlated with the the local distribution of extragalactic supermassive black holes 
at the center of nearby galaxies (AGN). First results seemed to favour a non-uniform density 



but as underlined by Fay et al. (2011), a more recent analysis based on 69 observations of 



UHECR softens this conclusion of anisotropy. To this prospect, these relevant considerations 
lead naturally to goodness-of-fit testing on the uniform density. 



Fay et al. (2011) recently developed isotropy goodness-of-fit tests based on the so-called 



needlets for the direct case. Their study is focused on the practical aspect with simulations 
connected to realistic cosmic rays scenari. In our work, we consider the indirect case in which 
some noise occurs. Indeed, the observations of the events Xj are always most often perturbated 
by a secondary noise (ej) which leads to the convolution statistical model ([I]). Here, we put 
the emphasis on theoretical statements and completed them with promising simulations. In 
particular, we consider complex noise distributions on SO(3) such as the Gaussian and the 
Laplace ones which have never been implemented so far. 

Considering goodness-of-fit testing in the spherical convolution model not only finds its in- 
terest in the above important applications, but it also fills a gap both in the noisy setup testing 
literature and the spherical convolution one. Indeed, convolution models have been extensively 
studied on the real line and more recently on the sphere. However, so far, only estimation has 
been treated in the spherical setup. For estimation problem, one is interested in recovering the 



underlying density / from noisy observations Zi. The pioneer works of Healy et al. (1998), Kim 



and Koo (2002), Kim et al. (2004) introduced a minimax estimation procedure based on the 



Fourier basis of L2(S^. Recently, Kerkyacharian et al. (2011) proposed an optimal and adaptive 
hard thresholding estimation procedure based on needlets. 

Nonpar ametric goodness-of-fit testing has aroused a lot of interest. For minimax testing, we 



refer to the work of Ingster (1993) who settled down the theoretical basements. Spokoiny (1996) 



first established adaptive testing procedure based on wavelets over Besov bodies. Nonetheless, 
goodness-of-fit testing has mainly focused on the case of direct observations. Indeed, very few 
works have been devoted to the case of indirect observations and if so, only on K. On this point, 



let us cite the pioneer works of Butucea (2007) who built minimax nonparametric goodness-of-fit 



testing for convolution models based on kernels methods, of Butucea et al. (2009) which made a 



step forward by building an adaptive testing procedure in the noisy setup. 

We would also like to bring to the reader's attention some interesting fact when encountering 
testing problems with indirect observations. Indeed, there is a natural pas de deux between the 
following approaches : to test f = fo or to test f £ * f = f e * fo- This question has been the 



object of the recent work of Laurent et al. (|2011|) and has been previously evoked by Butucea 

Dubes and Marteau 
* f = fe * fo versus 



et al. (2009 ). In the case of the convolution model on the real line, Laurent, Loubes and Marteau 
prove that if a test procedure is minimax for testing problem : Hq° : f £ 
■ fe*(f ~ fo) e Fd where 



H? 



•Fd = {g with smoothness s' and ||g|| > C'n s '' s + ', with s' = s + v}, 



then it is minimax for Hq ■ f = fo versus H[ : f — /o G J~i where 

Ti = {/ with smoothness s and ||/|| 2 > Cn' As ^ As+iu+l ^} 

but the reverse is not true (here n is the number of data and v the smoothness index of the 
noise). This interesting conclusion (that we can conjecture true in our context also) does not 
make it any the less necessary to study the inverse problem here. Indeed, until the present work, 
the minimax rates was not known in the context of noisy spherical data. Moreover, when dealing 
with adaptive procedures, the link between the direct and inverse problems is not established 
yet. 

In the present paper, the whole difficulty actually lies in the spherical geometry which com- 
plicates every steps that one encounters on R. Indeed, the efficient test statistic of |Butucea" 
(2007) was built upon a deconvolution kernel estimator of the quadratic functional f (/ — fo) 2 . It 
is well-known that such an estimator is closely linked to the Fourier transform on K. There exist 
kernel methods to treat density estimation for spherical data but only for direct observations 



(see Hall et al. (1987), and Bai et al. (1988)). Here in the spherical convolution context, Fourier 
analysis has a different behaviour and we resort to existing procedures to estimate the quadratic 
risk J(f — fo) 2 . Those procedures (see Kim and Koo (2002)) are based on Fourier series which 



come down to projections. Consequently, the approach proves to be quite different than the one 
on the real line. The difficulty of testing in a spherical deconvolution model can be seen in the 
following way. If you use an orthogonal basis {ipk) to estimate the unknown function /, then 
using U-statistics requires that the "deconvoluted" basis <j>k (s.t. ipk = fe * 4>k) is also (almost) 
orthogonal, which is delicate to realize. Thus one has to circumvent new problems linked to 
estimation of the quadratic functional, Fourier series, spherical context and convolution model 
setting. This explains why we choose to use spherical harmonics and their good properties in 
term of orthogonality. 

The plan of the paper is as follows. In Section 2, we give a brief overview about harmonic 
analysis on SO (3) and S 2 which will be necessary throughout the paper. In Section 3 we define 
the test hypotheses and the smoothness assumptions about the unknown density / and the noise 
£i. We also introduce the adaptive goodness-of-fit testing procedure. In Sections 4 and 5 we 
compute lower and upper bounds for testing rates for the ordinary smooth noise case. The super 
smooth noise case is treated in Section 6. Finally, we give a simulation study in Section 7. The 
proofs of the results are detailed in Section 8. 



2 Some preliminaries about harmonic analysis on SO (3) and § s 



We will provide a brief overview of Fourier analysis on SO (3) and §> 2 . Most of the material can be 
found in expanded form in Vilenkin (1968), Talman ( 1968| ), Terras (1985), Kim and Koo (2002) 
andlHealy et all (119981). 



Let D mn for —l<m, n < I, 1 = 0, 1, ... be the eigenfunctions of the Laplace Beltrami 
operator on SO(3), hence, y/2l + lD mn , —l<m, n <l, 1 = 0, 1, ... is a complete orthonormal 
basis for L2(SO(3)) with respect to the probability Haar measure. Explicit formulae of the 
rotational harmonics D l mn in term of Euler angles exist but we do not need it here. Next, for 
/ G L2(SO(3)), we define the rotational Fourier transform on SO(3) by the (21 + 1) x (21 + 1) 
matrices / with entries 

fmn= I f(g)D l mn (g)dg, 



SO(3) 



where dg is the probability Haar measure on SO (3). The rotational inversion can be obtained by 

/o?)=E E /™( 2I+1 )S ( 2 ) 

l —l<m, n<l 

(pj is to be understood in L2-sense although with additional smoothness conditions, it can hold 
pointwise. 

A parallel spherical Fourier analysis is available on §> 2 . Any point on § 2 can be represented 
by 

u> = (cos(j)sm9,sm(/)sm8, cos#)*, 

with (p € [0, 2tt), £ [0, ir). We also define the functions: 



Y m ( w ) — Y n 



(2l + l)(l-m)l t , 9) jm+ 



for —l<m < I, I = 0, 1, . . ., <fi € [0, 2ir), 9 G [0, 7f) and where P l m are the associated Legendre 
functions. The functions Yj^ obey 



Y l 



(~l) m Yl 



(3) 



Let L2(§ 2 ) denote the space of square integrable functions on § 2 , that is, the set of measurable 
functions / on S 2 for which 



/ |/( 



x)\ dx 



< oo, 



where dx is the Lebesgue measure on the sphere S 2 . It is well-known that L2(S 2 ) is a Hilbert 
space with the inner product 



(f,gh 



f(x)g(x)dx, f,geh 2 (§ 2 



The set {Y^, —l<m < I, 1 = 0, 1, . . .} is forming an orthonormal basis of L2(S 2 ), generally 
referred to as the spherical harmonic basis. Again, as above, for / G L2(S 2 ), we define the 
spherical Fourier transform on § 2 by 



/*< = / f(x)Yj n (x)dx. (4) 

We think of Q as the vector entries of the (2/ + 1) vector 

/* = [fm]-l<m<l, 1 = 0, 1, . . . 

The spherical inversion can be obtained by 

/(«) = £ £ /K»- 



J -Km<l 



The bases detailed above are important because they realize a singular value decomposition of the 
convolution operator created by our model. In effect, we define for f £ £ L2(SO(3)), / G L2(§ 2 ) 
the convolution by the following formula: 

fe * f(u>) = f f £ (u)f{ U - l U:)du (5) 

JSO(3) 

and we have for all — I < m < I, I = 0, 1, . . ., 

I 

\Ie * J)rn = £_, Je,mnJn = \Je J )m- (6) 

n=—l 

We shall recall some basic facts which will be useful throughout the paper. Let HI; the vector 
space spanned by {Y^ = — I < m < 1} for each I = 0, 1, ... . Any element h £ Mi can be 
written as h = X^m=— z ^rnXm an( ^ thanks to Parseval equality we have [|/i||2 = Ylm,=-i\^ l rn\ 2 - 
Now according to ^ we have 

ff : M t -> m t defined by /*</* = £ ( £ (/*')mn>*n J *£• 

m=—l \n=—l / 

We finally get the operator inequality 

||/^|| 2 < ||/ £ i| op |N| 2 , where ||/*<|| op = sup ij^M*. 

3 Model and assumptions 



We would like to present our results in terms of Sobolev classes (see e.g. Healy et al. (1998) for a 
definition on the sphere). On the space C°°(§ 2 ) of infinitely continuous differentiable functions 
on § 2 , consider the so-called Sobolev norm || || w s of order s defined in the following way. For any 
function / = ]T, m j^Y l m let 



2 



Y, Ea+ i(i +i))*i£fi 2 - ( ? ) 



Z>0 m=-l 



We denote by Ws(§ 2 ) the vector space completion of C°°(§ 2 ) with respect to the Sobolev norm 
(M) of order s. For some fixed constant R > 0, let W S (S> 2 ,R) denote the smoothness class of 
functions / G Ws(§ 2 ) which satisfy 

\\f\\w.<R. (8) 

For the uniform density of probability on the sphere namely /o = (47r)~ Iga, we want to test 
the hypothesis 

Ho : / = /0) 

from observations Z\, . . . , Z^ given by model (nj). We consider the alternative 

H^R^n) : / E W S (§ 2 ,R) and ||/ - / ||§ > Cfa 

where C is a constant and ipN is the testing rate. 

We will say that the distribution of e is ordinary smooth of order v if the rotational Fourier 
transform of f £ satisfies the following assumption. 

Assumption 1. For all I > 0, the matrix /* is invertible and there exist positive constants 
do,d\,v such that 

\\f?-A\o P < d l l» and ||/*'|| op < dil~ v , 

where we have denoted the matrix (/* ) _1 by f*L 1 . 

Recall that we assume that f e is known, consequently do and v are also considered known. 
Some examples satisfying this assumption are given in Section [7] 

In order to build a test statistic, as usual, we first have to construct an unbiased estimator of 
the quadratic functional J^if — fo) 2 = ||/ — /oil!- To do so, we remark that thanks to Parseval 
equality: 



/(/-/o) 2 = E E \ti - foi\ 2 = E E i/. 

"^ S> 7-^.n ^, J 7-^.1 , 7 



*Zi2 



l>0 m =-l l>l m =-l 



the last equality coming from the fact that (fo) m 7^ only for (I, m) = (0, 0). Since /*' = f*Li_f z l 
for / = 0, 1, . . . , we can write under Assumption 1 

l 

fm = /-J (/e-OmnC/z )n- 
n=— I 

A natural estimator of /^ is given by 

JV I 
fm = T7 2-^i 2-^i \fe- 1 )™,'nY n (Zi). 



N 

i=l n=—l 



If we denote by $; m (x) = ^ n= _ i (/*l 1 ) m „y'i(3;) then 

JV 
N 



1 * 



i=l 



Consequently, we can derive an unbiased estimator T\ m of |/, 

2 



*i|2 



-£m 



N(N 

' Il<l2 



pr J] ^i m {Z h )^ lm {Z i2 ), 



and finally an estimator of ||/ — /o 111 

i l 2 

^ = E E JV(JV-l) E <$>lm{Z tl )$> lm {Z l2 ). 
1=1 m=—l «i<«2 

We can now define a test procedure 



A= fl if|T L |>t 2 
1 otherwise 

for a threshold t 2 to be suitably chosen. The choice of L is crucial too, ans this point will be 
solved in Sections 5 & 6. 

As one may have noticed, the noise smoothness hypothesis and hence the test procedure only 
rely on the Fourier transform of the noise density f e . Consequently, we do not need the existence 
of the density f e but only its Fourier transform. 

4 Lower bound for testing rate 

It is known that the rate of separation in the case of direct observations in dimension two is 
^y-4s/4s+2^ ^t us see how it is modified by the presence of a noise with smoothness v. 

Theorem 1. Assume that R > 1/^/2tt. Let s > 1 and tp N = N- 2s ^ 2s+2u+1 \ Let 7] G (0, 1). // 
C < KR 2 where K is a constant only depending on do,di,v,s,r} ) then 

hminfinf ip /o (Ajv = l)+ sup P f (A N = 0)\ >r] 

N^oo A N (^ feHtfaRfiM J 

where the infimum is taken over all test procedures A^v based on the observations Z±, . . . , Zjy. 

This means that testing with a faster rate than ijjjy = j\f- 2s /( 2s + 2 ' / + 1 ) [ s impossible. If the 
distance beetween /o and the alternative is smaller than t/jj^ = ]\[- 2s /{ 2s + 2u + 1 ) ; the sum of the 
error of the two kinds is close to 1. Nevertheless, it requires the knowledge of the smoothness 
index s. That is why we want to build on a so-called adaptive test procedure which does not 
depend on s. But we prove in the next statement that we have to face a phenomenon of "lack 
of adaptability" for our problem, i.e. it is not possible to test adaptively with the same rate. 
Indeed, in the context where s is unknown and belongs to some set S, there is not any universal 
test with small error for each s £ S. The price to pay for adaptivity is an extra factor ydog log N 
in the separation rate. 



Theorem 2. Assume that R > 1/^/2tt. For alls > 1, let^(s) = (N/^log \og(N))- 2s ^ 2s+2,/+1 '> . 
Let 7] £ (0,1). If C < KB? where K is a constant only depending on do,di, v, s,rj, then, if S is 
a set containing an interval, 

liminf inf \ P /o (A N = 1) + sup sup Pf(A N = 0) \ > rj 

N-+oc A N (^ seS f eHl (s,R,Ci;% d (s)) J 

where the infimum is taken over all test procedures Ajy based on the observations Z\, . . . , Z^. 
Moreover any rate faster than ip^f will also lead to a lower-bounded error. 

5 Upper bound for testing rate 

In order to construct an adaptive procedure of testing, we shall use the following exponential 
inequality. 

Lemma 1. There exists Kq,K\ such that, for all sequence un, 

F {\T L \ > L 2u+l u N /N) < K ie M-Kou 2 N ) 

provided that uj^L^ 1 , LN~ 2 ift N and LN~ l v? N are bounded. 

Actually the term L 2u+l /N is the order of the variance of Tl under Hq. We denote \x~\ the 
smallest integer larger than or equal to x. 

Theorem 3. Assume s > 1 and ipw = (N/ V^og log N) - 2s /( 2s + 2u + 1 ) . \y e consider the set C = 
{2i°, . . . , 2- ?m } where jo = |~log 2 (log log N)] , j m = |~log 2 (iV(loglog A)~ 3 / 2 )] and the adaptive test 
statistic 

Dn = 1 {m^ LeC {\T L \/tl)>^/2/K~ } 



with t\ = L 2u+1 ^/\og\ogN/N. Let < 77 < 1. Then, if C > \]2K^ 1 + R 2 , 

limsupiP/ (Z>iv = l)+ sup Pf(D N = 0) \ < rj. 

Af^oo [ feH-,{s,R,Ci!> N ) J 

Note that the direct case (without noise) is included in this result, taking e = Id, f* = Id, 
v = 0. In this case, the separation rate is (N/^log log N)~ 2s ' ( 2s + 1 ). To our knowledge, even in 
this simpler case, this result was not established yet. 

6 Super smooth noise 

In this section, we deal with the case of a super smooth noise. This kind of noise is of interest 
since it includes the Gaussian distribution. We will say that the distribution of e is super smooth 
of order v if the rotational Fourier transform of f £ satisfies 

Assumption 2. For all I > 0, the matrix f* is invertible and there exist reals v\ < uq, and 
positive constants do,d\,5,f3 such that 

\\ff-A\op < du l r v * exp(^/<5) and \\ff\\ op < d x r exp(-l p /S). 



In this case, we present a similar test statistic but with a different threshold t^. Moreover it 
is sufficient to consider only one L* instead of a maximum. 



Theorem 4. Let ip N = (logiV)" 2 ^ and K > 0. We consider L* = (5log(N)/8) 1/l3 
test statistic 

D N 



and the 



1 {\T L ,\/tl t >K } 



with t\ = L~ 2uo+1 exp(2L^/6)/N. Let < ry < 1. Then, if C > K + R 2 (5/W) 



-2s IP 



lim sup < P/ (Dn 

7V-s>oo 



sup ¥ f (D N = 0) } < 77. 

f£Hi(s,R,Ci> n ) I 



We observe that in this case the separation rate is very slow ipN = (log N)~ 2s '". However, 
this rate is reached without any knowledge on the smoothness of /. Moreover, we prove that 
this is the optimal rate: 

Theorem 5. Assume that R > l/V^r. Let s > 1/2 and ip N = (logN)- 2s ^. Let 77 E (0, 1). // 
C < KR 2 where K is a constant only depending on do,di,Uo,/3,5,s,rj, then 



lim inf inf 

7V->-oo Ajv 



> /o (A;v = l)+ sup P f (A N = 0) \ > r? 

f£Hi(s,R,Cxl> N (s)) J 



where the infimum is taken over all test procedures A^r based on the observations Z\, . . . , Zjv- 
The deterioration of the rate in the case of a super smooth noise is a well-known phenomenon 



in convolution models (see e.g. Fan (1991)). 



7 Simulations 

In this section we implement our test procedure for two kinds of noise. First we will present the 
noises on SO(3) we consider, then we explain the protocol we are going to follow and eventually 
we give the performances of our goodness-of-fit test in term of power. 

In our theoretical statements, we talked about ordinary and super smooth noises on the group 
SO (3), but what does it mean in practice? Does it exist concrete examples of random matrices 
which could be generated according to densities which meet those smoothness assumptions? The 
answer is yes. We will particularly highlight two cases, the Rotational Laplace and the Gaussian 



densities on SO(3) (for further details see Kim and Koo (2002)). To the best of our knowledge, 
they have never been implemented in practice. The first one is an ordinary smooth density and 
the second one a super smooth one. 

As explained in section 3, the noise smoothness can be characterized by the decay of its 
rotational Fourier transform. The Rotational Laplace distribution is the rotational analogue of 
the well-known Euclidean Laplace distribution (known also as double exponential distribution). 
It has been discussed in depth in Healy et al. ( |1998 ). Its expanded form in term of rotational 
harmonics is the following 



/- = E E( 1 + cj2 ^ + 1 )r 1 ( 2/ + 1 )^ 



0) 



Z>0 m=-l 



for some a 2 > which is a variance parameter. Hence we have 

f*j mn = (l + a 2 l(l + l))- 1 6 m n, 

for I = 0, 1, . . . and where 5 mn = 1 if m = n and is otherwise. The Laplace distribution is 
ordinary smooth with a smoothness index u = 2. 

Let us present now the Gaussian distribution. The distribution can be written as follows (see 
Kim and Koo] ( |2002| )) 



fe = E E exp(-a 2 Z(Z + 1)/2)(2Z + l)D^ m , (10) 

l>0 m=-l 

for a > 0. We have 

/* z mn = exp(-a 2 Z(Z + 1)/2)<W 

This is an example of a super smooth distribution with 5 = 2/a 2 and (3 = 2 following the 
terminology in Section 6. 

Here is our testing procedure steps. 

1. Under Hq generate N observations uniformly. 

2. With these N uniform observations, compute by 1000 Monte Carlo runs the 10% quantile 
of the statistics maxi g £(|Ti|/t^) defined in Theorem 3. We will denote it qio. 

At this stage it is possible to apply the test to any noised data. Here we study the performance 
of the test with the following next steps. 

3. Generate iV random rotation matrices following the Laplace or the Gaussian distribution 
and apply them to N observations generated according to the alternative density g in Hi 
(see Figure 1). 

4. Compute the test power given by P 9 (max/ /g £(|T^|/t| / ) > qio) by 200 Monte Carlo runs. 

We point out that the numerical procedure is notably fast all the more so as we are in di- 
mension 2. 

The alternative density g is represented in spherical coordinates on Figure 1. It has two 
modes, one very sharp and the other one oblate. 

For more clarity about how the noise matrices are involved in the problem, we represent 500 
observations generated under the alternative density g with and without noise, see Figure 2. To 
plot the figure, we consider the Laplace distribution noise with a variance parameter a 2 = 0.5. 
It turned out that applying random rotations tends to spread uniformly the observations which 
makes the separation between the null and the alternative hypothesis a difficult task. 

We would also like to make a remark about how to generate random matrices according to 
the Laplace or the Gaussian distribution. After rewriting carefully their density expressions in 
terms of rotational harmonics given by (|9| and (10), it turned out that f £ (u) only depends on 



the angle of the rotation u, say 6. Then the simulation of a rotation following f £ amounts to 
pick at random an axis and perform a rotation about this axis by an angle following the law 

/ e (0)(l-COB(0))/7T. 

10 
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Figure 1: The alternative density 
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Figure 2: Observations under Hi without and with noise 

For more conciseness, we do not present the empirical level of the test which is whatever 
the noise, the number of observations, between 6% and 12%. Indeed, the procedure is devised 
in such a way that we have a level around 10%. Let us focus now on the test power which is 
more informative. We computed it for several number of observations going from 50 only to 
500 and give our results in Table [T] We point out that we deliberately considered the cases of 
very few observations such as 50 and 100 because these scenari are realistic in astrophysics when 
collecting UHECR observations. Our test procedure performs pretty well with excellent results 
if we have 500 observations. For reasonable values of the variance parameters such as 0.1, the 
procedure yields reasonable results even for 50 or 100 observations only. 
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Table 1: Test Power 





Noise distribution 


iV = 50 


N = 100 


iV = 250 


JV = 500 




Laplace, a 2 = 0.1 
a 2 = 0.25 
a 2 = 0.5 


0.59 
0.37 
0.20 


0.93 
0.68 
0.30 


1 
0.99 
0.63 


1 

1 

0.96 




Gaussian, a 2 = 0.1 
a 2 = 0.25 
a 2 = 0.5 


0.42 
0.30 
0.16 


0.81 
0.35 
0.33 


0.99 
0.77 
0.50 


1 
0.98 
0.83 


8 Proofs 
8.1 Proof ( 


Df Theorem Q] 











As usual in the proofs of lower bounds, we build a set of functions quite far from /q in term of L2 
norm, but whose distance between the resulting models is small. More precisely, let 7 = 7(iV) 
and L = L(N), respectively a scale factor and a resolution level to be specified later. For all 
I < L and m £ {—I, ... ,1}, we define (fi m the function such that 



fe * <Ph 



yL. 



Here Y^ denotes the real form of the spherical harmonic, that we denote as the complex form 
for the sake of simplicity. Using the real form ensures that ipi m is real. The existence of such 
a function is ensured by the assumption of invertibility of matrices f* 1 and we can write tpi m = 
En=-!(/ri)nm^ Now, for 9i m , I < L, m 6 {—I,..., I}, independent random variables with 



distribution 



lm 



±7) = 1/2, we introduce 



L-l l 

l=L/2m=-l 

In the sequel we show that, for good choices of 7 and L, 

• f g belongs to W S (S 2 ,R), 

• fg is a density function, 

• ||/e-/o|| 2 >C^, 

• X 2 (Pe, Pf ) < (1 — v) 2 f° r N large enough. 

Then, if T is the maximum likelihood test which is equal to 1 if fL=l fd(Xi)/ fo(Xi) > 1 and 
otherwise, 

P fo (T = 1) + P e {T = 0) > fmm(dP e ,dP f0 ) > 1 - yj ' X 2 {dPe,dP fo ) > 1 - (1 - 77) (11) 



using section 2.4.1 in Tsybakov (2009). Thus, for N large enough, 

P /o (T = l)+ sup P / (T = 0)>7 ? 

f€Hx(s,R,C^ N ) 
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and the result is proved. 



• Belonging to the Sobolev space: 
We compute 

J2(l + l(l + l)) s \<feX>\ 2 

In 



L-l 

= |< /e ,y o>|2+ £ Y,(l + Kl + m<feX> 

l=L/2 n 
L-l I 

< \ <fo x>\ 2 + £u+^ + i)) s Ei E m/« 

l=L/2 n m=-l 

1 L-l J 



*/ \ 1 2 

1 jnm\ 



l=L/2 



m=—l 



L-l 



«=L/2 



Hence, belonging to the Sobolev ball imposes that 



2 T 2(s+u+l) 



Yl 



< 



R 2 



2C 1 (d ,8,v) 
Then it is sufficient to choose 7 2 = Cl L- 2 ^ +s+1 ^ with c\ < R 2 /(2C 1 (d , s, v)). 

• Density: 
Since, for I > 0, 



'Am 



Vim — {<Plm)o — 



we obtain J fg = 1. Let us now show that fg > 0. We first use the Cauchy-Schwarz inequality 



L-l 



L-l 



mx) - Mx)\ 2 < e Ek/*-/o,^>i 2 E Ei*)P 

Z=L/2 n l=L/2 n 



Next, since spherical harmonics have the property Y2n l^nl — (^ + l)/(4vr 



L-l 



i/ e (x)-/ (x)i 2 < E E 

/=L/2 n 
„ L-l 



E °lm(fs- 



m=—l 



* £ E ntf-ik E \^\ 2L 



L-l 

£ 

Z=L/2 
2 



2/ + 1 

47T 



l=L/2 



m=—l 



9d 2 2 L "' 
< — 5_7 2 



8vr 



E ' 2 " +1 ^ 2 < C 2 7 2 L 2 ^ 



+4 



l=L/2 
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with C| = 9d Q /(87t(2u + 2)). Thus, replacing 7 by its value, 

fe>h~ Wfe - /olloo > i- - C 2l L^ 2 > ±- - C^L 1 -*. 

Air 47r 

Now, since s > 1, /g is a density as soon as 

1 „ ,— v + \ 

- A C 2 Jc[ > 44> c\ < k. 

4tt v " " 97rd 2 



• Separation rate: 

V 

L-l i 



We denote Pe = f £ * f e = f + Ya=lIi T, m e lmXL- Then 



r W^ I 2 



/oil 2 = n/ £ *(/ e -/o)ii 2 = E Ei(£U-/° 

l=L/2m=-l 

L-l I 

< E 11^6 E it/* - /o)-i 2 ^ ^ 2 (V2)- 2 ii/ e - /oi 

Z=L/2 m=-Z 



Moreover 



\\po - /oil 2 = E E i^i 2 = 7 2 E ( 2/ + !) ^ c ^ 2 

Z=L/2 m l=L/2 

Finally ||/„ - / || 2 > C 3 (di, v)~i 2 L 2u+2 = C 3 (di, v)c x L- 2s . Now we choose 



L 



^yl/(2s+2z/+l 



where |_a?J denotes the largest integer which is smaller than or equal to x. Thus ||/e — /o|| 2 > 
6*3(^1, j/)ciV , at > CV'JV as soon as C < Cs(di, v)c\. 

• Chi-square divergence: 

We denote // the measure defined by dfi(6) = Y\i m (5i + <5_i)/2. We want to show that 

where 

,itd r N (7- N 



^-yn»*)-E,n*«<*>- 



First, note that Ej (47T£>6i(Zi)) = 1 + Ylim^Qlm f Yjn = 1- Then, using Fubini and the indepen- 
dence of the Zj, E/ ( 3 p iL ) = E M niLi ^/o(47i"P6»(Zj)) = 1. So it is sufficient to prove that 

E 4(t)> i+(i -" 2 
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Using Fubini, 



(4vT 



^V E /o 



dP„ 



dP 



h 



A 



A 



N 



i=X 



E /o y&^X\pe{Zi)X{p8<{Z t ) \ = E /o iE^HpeiZ^pe'iZi 



where d(fi x fi)((9,9')) = dfi(9)dfi(9'). Now, the Zi being independent, we write 



A' 



A' 



%o \{ve{Z l )ve'{Z l ) = Y[E fo (p (Z i )p 9 '(Z i )) 



\i=l 



i=l 



PePe* 



47T 



A 



(12) 



Using the definition of pq and the orthogonality of the YJl, we obtain 



L-l I 



Jp°p°' = ^+ E E 



lm"lm- 



(13) 



l=L/2m=-l 



Combining (12) and (13), and then inequality 1 + a < e a , gives 

A 



E/o \JlMZdP0>(Z<)\ = f^p + ^E^O ^(Jp^P^^E 1 



ImPlm) 



< 



(4vr 



-2^ JJexp(iY4vr^ m 6' z / r7 



z?« 



so that 



E 



./n 



dP„ 



dP 



/o 



^iE^x^n^p^ 4 ^" 1 ^)- 



/m 



Using the distribution of (9, 9'), we obtain 



E- >> dP » 



'-fa 



dP f . 



-i— r 1 1 -i— r 

- II " ex P( Ar 7 2 4 7r ) + - exp(-A^7 2 4vr) < J^[cosh(A^7 2 47r). 



■> 0/ / lm 

Now, using cosh(2x) = 1 + 2sinh (x) and inequality 1 + a < e a , 

2\ 



/»i 



- • , dP u 

E- ' ( M 



^/o 



dP 



/o 



< 



< 



\\(l + 2sinh 2 (iV7 2 27r)) < J^[exp(2smh 2 (iV7 2 27r)) 



lm 



lm 



exp(2^sinh 2 (iV7 2 27r)). 



lm 



Since 7V7 2 — > and sinh(x) = x + o(x), there exists C4 > such that, for iV large enough, 



2„,4 



sinh^A^vr) < C4W7 



15 



Then 

^sinh 2 (iV7 2 27r) < C 4 iV 2 7 4 ^ 1 < C 5 N 2 -f 4 L 2 . 

lm Im 

That yields, for N large enough, 

E /o ((^) 2 ) < exp(2C 5 iVVi 2 ) < exp(2C 5 c 2 iV 2 L- 4 - 4 - 2 ). 

But remember that N 2 < (L + l) is+4u+2 so that E /o ( (^) ) < exp(C 6 (s, i/)c?) < l + (l-r/) 2 
for a good choice of c\ . 

8.2 Proof of Theorem H 

We follow the same proof as the one of Theorem [T] but with this time a random L. Let k^ = 
\ L (\ogN) s \. We choose kjy elements of S: s\ < ■ ■ ■ < Sk N such that Sj+i — Sj > log(N)~ 1 . 
Then it is possible to find J\ > • • • > J^ N > 1 such that for all 1 < j < Zcjv, N / ^\og{kj^) = 
c 2 2 J j < y 2u+2s 3 +1 ). We also define j 2 = ci2~ 2J ^ u+s 3 +1 \ We consider hypothesis functions 

L-i l 
fe = /o + 2^, z_^ ^ Qumflm 

L l=L/2m=-l 

and we take a prior of the form /i = kjy~ £7=1 Mi- Then # is randomly chosen such that 
Hj(6i m = ±7i) = 1/2 if L = 2 J J,2 J J- 1 < Z < 2 j j, -Z < m < Z and /i-,(6»/ m = 0) = 1 otherwise. 
This means that L is fixed equal to 2 j with probability 1/kjy and random densities with respect 
to the measures \Xj have the following form 

L-\ l 
fe = fo+ 2^ 2_> OlmPlm 

l=L/2m=-l 

where //j(0z m = ±7j) = 1/2. 

Given the proof of Theorem 111 we easily verify that //j-a.s. fg G H%(sj, R,Cipff(sj)) if c\ 
is chosen small enough. Now, since sup se5 supr eff , s RCtb ad (s)) Pfi^-N = 0) > P m (Ajv = 0) and 
according to (11), it is sufficient to bound the x 2 -divergence. So we will show that 



'% supE m(s|) j- 1+(1 -" )2 



which comes back to 



^E E /o (- 



Em r p i^ E H^^» sl+(1 -" )2 



P,9= 
Using Fubini's Theorem and independence of the Zj's, 



i=l 
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Denoting a t (Li,L 2 ) = ^^L 1 /2<l<L 1 1 L 2 /2<l<L 2 we can write 

/ 4irpop e , = 1+ ^ ^6 L i m 6' Llm ai(Li,L 2 ) 

J T . T 1 

Thus 



L\,L 2 Im 



A <exp JV ^ 6 Llm 8 / Llm ai(L 1 ,L 2 )\ < JJ exp(N0 L i m e , Um ai(L 1 ,L 2 )). 

\ LiL2lm J L\L 2 lm 



4vr / p e p e 
Using the distribution of (9, 9'), we obtain 

\ UJr fo UJr fo/ 

1 Y[cosh(Nj plq ai(2 J »,2 J «)). 



Ei ' d dp pd dp q ) ~ Xl^MNiviM^^ + ^M-NiviM^^)) 

/o /o / lm 



I III 



Now, using cosh(2a;) = 1 + 2sinh (x), inequality 1 + a < e a and sinh(x) = x + o(x) 

dP u dPa 



E 



/o 



'dPZdPT 



< exp(2^ S inh 2 (iV 7p 7 g a i (2^,2^)/2)) < exp^iV^^ 2 ^ |^(2 J -, 2 J 



We observe that ai(2 Jp ,2 Jq ) = as soon as J p / J q . That yields 



/■m 



,'dP u dP u 

E- ( 



^/o 



dP f0 dP 



< \ 

^J < exp(C 2 7V 2 7p 4 2 2 ^l p=g ). 



Then 



A; 



3T £ % (^^) * iE ex P(^\ 422Jp ) ^ ^exp(C 2 c 2 c 2 log(M) < A^" 1 



A- 



N 



p=l 



fcjy 



which is bounded by 1 + (1 — 7/) 2 for TV large enough if we choose c\ small enough. 
8.3 Proof of Lemma [T] 



We recall the result from Gine et al. ( 2000 1 . 



Lemma 2. Let u a bounded canonical kernel, completely degenerate of the i.i.d. variables 
Zi,..., Zn ■ There exist universal constants Ki, K 2 > such that, for all x > 0, 



/ , u(Zj 1 ,Zj ; 

l<i 1 ^i 2 <N 



> x I < K\ exp — K 2 min 



X X X I X 



2/3 1/2 



C 2 ' D ' £ 2 / 3 ' A 1 / 2 



where A, B, C, D are defined by 

A=||ii||oo, B 2 = N\\E{\u\ 2 (Z 

and 



C 2 = N 2 E[\u\ 2 (Z 1 ,Z 2 )] 
D = Nsup{\E[u(Z l ,Z 2 )\u 1 (Z 1 )u 2 (Z 2 )}\,E[u 2 (Z)}<l 1 E[u 2 (Z)]<l} 
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We apply this Lemma to the kernel 



L l 



«(«.!/) =J]5] $im(x)$i m (y). 



1=1 m=—l 



which is degenerate for Z{ under Hq. As one may have noticed, we stated the lemma above 



with a kernel u taking complex values. Normally, the result of Gine et al. (2000) was stated 



for real valued kernel. But their result can be extended to complex valued kernel by simply 
separating the real and imaginary parts as shown below. Indeed if we denote ur and ui the real 
and imaginary part of u it follows that 



/ u{Zi x ,Zi. 

l<i 1 ^i 2 <N 



> X 



y~] uR(z h ,z,, 



12, 



+ 



< 



y~] uR(z h ,z. 

l<i 19 £i 2 <N 



12, 



X 



l<ilJ4 a <N 



Yl ui(Z, h ,Zi^ 

l<i 11 Li 2 <N 

2 

2 
> - 



>x A 



Hence, it amounts to a real valued problem. We only deal with the real part since exactly 
the same arguments remain true for the imaginary part. Let us show now that the bounds 



A, B, C, D of Lemma 



hold for the real part 



U R + u j = M 2 ) we have ur < \ur\ < \u\. Then 



E,i<i 1 & 2 <N u R( Z h, z i2) >x/2\ . Because 



\ur\\oo < \\u\\oo < A 



N\\E((u R ) 2 (Z,.) 



< N\\E(\u\ 2 (Z,.) 



< B 1 



N 2 E[(ur) 2 (Z 1 ,Z 2 )]<N 2 E[\u\ 2 (Z 1 ,Z 2 )] < C 2 . 
As for the last term D, since u\ and u 2 are real valued 

|E(u(Zi, Z 2 )u 1 (Z 1 )u 2 (Z 2 ))\ 2 = \E(u R (Z 1 , Z 2 )u l (Z 1 )u 2 (Z 2 ))\ 2 + \E(u I (Z 1 , Z 2 )u 1 (Z 1 )u 2 (Z 2 ))\ 2 
it entails that 

Nsnp{E( U R(Zi,Z 2 ) Ul (Z 1 )u 2 (Z 2 ))} < iVsup{|E(n(Z 1 , Z 2 ) Ul {Z x )u 2 {Z 2 ))\} < D 
which concludes the justification our lemma. 

Let us compute now the four bounds, A, B, C, D. 

> Computation of A 
Denoting by Y (x) the vector (Y^(x))_K m </ and using algebraic properties of the spherical 
harmonics, 



£ \$ lm (x)\ 2 = ^\(f:L 1 Y\x)) m \ 2 < iitfitxxo*)! 8 < do 2 i 2v 



21 + 1 

Air 



(14) 
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We deduce that for all x,y £ § 2 , 

1/2 



L / I I \ ' L 

m*>v)\ ^ E ( E i^^i 2 E i<^)i 2 < Et-V^ 1 

i=l \m=— J m=—l / 1=1 

so that A < ( 7 r- 1 d^ 2 /(2i/ + 2))(L + 1) 2 ^+ 2 . 

> Computation of C 
We state the following Lemma, which allows to control the order of the variance of the test 
statistic. 

Lemma 3. Under Assumption 1, denoting C3 = 3d^ 2 4iy+2 /(4z^ + 2) 

L h h 

E E E ^fo^hmAmizmAm 2 < c 3 l^ +2 

lx,l 2 =l mi=—h m 2 =-h 
L h h 

and E E E l%o(^ imi (^)^ 2m2 (^))| 2 <C 3 L 4 ^ 2 . 

1x^2 = 1 mi=—h m 2 =— Z 2 

Proof. Under Hq, the Zj are uniformly distributed on the sphere. Then 
E /o ($ hmi $f 2m2 (Z)) = / &i imi <S>i 2m2 (z)dz 



7 j (J e -ijmini(/ e -i)m 2 n2 / ^n\\Z)Y n2 \z)dz 
/ J \J P —i)mi,n\J r -i)m2, , nAh=l2- 



But, for any matrices A = {a mn )-l<m<l-l<n<h B = ( b 



mn)—l<m<l,—l<n<l 



E|V^„ u |2 <- |U||2 \^il |2 <- II 4||2 II R Ti|2 _ 11 4 i|2 11 R ||2 

I / 4 a min<Jm2n\ J: I lollop / 4 l"m, 2 re| — I lollop 11-° Hop - II -"-Wop II -" Hop 

mi n n 

Then _ 

E |E /0 ($, imi $ j2m2 (Z))| 2 < 11/^11^(2^1 + l)l, 1=fa (15) 

mi,m 2 

and 

E E i%o(^ mi ^^))i 2 < E 11/^11^(2^+1) < 3<i - 4 e ^ +1 < ^( L + 1 ) 4 ^ 2 - 

; 1) i 2 = lmi,m 2 / 1= 1 Zi=l 

For the second term, we can write, using (p| 

E /0 ($ /imi ^ 2m2 (Z)) = Y,^)munU%)m,M-^T^h=l 2 - 
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Then it is sufficient to apply the same method with matrix B such that b mn = (— l) n (/*_\)m-n = 
( — l) n a m _ n . The conclusion results from equality ||-B|| op = ||A|| op . □ 

Lemmaggives C 2 < 3^ 4 /( 4 ^ + 2)N 2 {L + l) 4u+2 . 

> Computation of B 
Let x £ S 2 . We can write 

E fo [\u(Z,x)\ 2 ] = Y / Yl ^h[^hm 1 ^hm 2 (Z)]^l 2 m 2 (x)^i imi (x). 
l 1; l 2 rn 1 ,rn 2 



But we have seen previously than E^ [<&j imi $; 2m2 (Z)] vanishes when l\ ^ 1%. Then, using 
Cauchy-Schwarz inequality, we compute 



E f0 [\u(Z,x)\ 2 ]<Y^[ Yl l E /o0^l^m 2 (^)]| 2 Yl l<W*)<W*)l 



\ 1/2 
2 I 



I \mi,mj mi,m 2 



Now, we use previous computations (|15j) and (14) to state 



E f0 [u 2 (z,x)} < J^ (Wf^iWipW + i)) 1/2 El^(^)l 2 

I m 

i 

V^ V 3 4 ,4^+3/2 , V 3jg , r n 4i/+5/2 

" ^ vr d ° ' -7r(4^ + 5/2)^ +ij 

Thus 5 2 < /^n- l d Q A /{Av + 5/2)N(L + l) 4 ^+ 5 /2. 

o Computation of D 
Let us first compute Ej ( ( l ) ; ?n (Zi)Mi(Zi)) under Hq. We denote by C/{ the vector of the Fourier 
coefficients of u\ with harmonic order I: U\ = (< «i,l^ >)-i<n<l- 

Ef (&i m (Zi)ui(Zi)) = / $ /m (x)m(x)dx = ^ (ff-i) mn / F^(x)ui(x)& 

n=—l 
I 
= Y,^)mn<U 1 ,Yj l >=(f^L 1 Ui) m . 
n——l 

Then 



m=— i 

But, using Parseval's equality 

EWUif = J2J2\<u l ,Yj l >\ 2 = J ul(x)dx 



l>0 l>0 n=-l 
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so that, under Hq, £^ ||C^il| — ^/o( n i(-^i))- m the same wa Y we can prove 



l 



J2 \E f0 ($ lm (Z 2 )u 2 (Z 2 ))\ 2 < d^ 2 l 2v \\U l 2 \ 



m=—l 
r2 112 ^ TO . f„,2/ 



with ]T^ 1 1 U 2 \ | < Efg^K^i))- Then, using repeatedly Cauchy-Schwarz inequality, 

L l 
E fo (u(Z l ,Z 2 )u l (Z 1 )u2{Z 2 )) = Yl E ^f ^im(Z 1 )u 1 (Z 1 ))E fo ^ lm (Z 2 )u 2 (Z 2 )) 

1=1 m=—l 

L / I I \ l l 2 

^ El E l%o(^m(^l)«l(^l))| 2 Yl \&f Q ^UZ 2 )MZ2))\ 2 \ 
1=1 \m=—l m=—l / 

L / L L \ I 

< E d o 2 ^n^iiii^ii<^ 2 ^ Eii^ii'Eii^ii 2 
i=i V/=i i=i j 

Thus D < d 2 NL 2u . 

Conclusion 

Now, using Lemma p] with x = N(N — l)t/2, we obtain 

/ ( N 2 t 2 Nt Nt 2 / 3 N^^W 

P {\T L \ >t)<K x exp \-K 3 min I -^+2 , j^, L4 , /3+5/6 , -jj^ J J 

where K% only depends on do and v. Then 

P(\T L \ >L 2 ^u N /N) < K X exp (-Jf 8 min (u 2 N ,u N L, N^ufL^, N l / 2 uf L^/ 2 )) 

< K\ exp(-K u%) 
provided that u N = O(L), L = <3(iV 2 u^ 8 ) and L = 0(Nu N 3 ). 

8.4 Proof of Theorem M 

First, using Lemma [T] 

P/o(Av = 1) < J2 P/ (\T L \ > sJ^Hl) < J2 P/o (\Tl\ > L^^K^loglogN/N) 
Lee ^ ' Lee ^ ' 

< Y. Kl exp(- J ftT (2K - 1 log log N)) = K x J^ exp(-21oglogiV) 
Lee Lee 

< K 2 \C\(log(N)y 2 = OOog(iV)- 1 ) = o(l) 

since |£| = 0(log(A r )). Then, for iV large enough, Pj (Z?jv = 1) < ^/2. 
Now let / G Hi(s,R,Cil> N ). Then 

P/(Av = 0) =P/ (VL € £, |T L | < ^IK^t 2 ) < F f (\T L *\ < yj2K^^\ 
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with L* = 2i* and j* = [log 2 [(N/y/\og]ogN) 1 ^ 2s+2u+1 'l]\. Remark that for N large enough, 
4 log log N < (iV/Vlog log iV) 1 /(2 S +2z,+i) < at/ (log log iV) 3 / 2 , so that j < j* < j m and L* 
belongs to C. Note also that with this choice t 2 L * < ip^ and L*~ 2s < 2 2s, 0jv Using triangle 
inequality we have that 

F f (\T L *\ < T^tl.) < P/ (\T L . - E f (T L .)\ > 11/ - /oil! " ^2^4!* - B f (T L *)\ (16) 

where 5/(T L ) = ||/ - /bill - %(T L ). If / is in the Sobolev ball W a (S 2 ,R), it directly follows 
from the definition of W S (S 2 , R) @ that 



B f( T L) = EE i/™i 2 ^ r2l ~ 2s ^ r 2 ^n. 

1>L rn 

We set d -= \I2Kq 1 + R 2 2 2s and C 2 = 1 - C x /C> 0. Using the definition of H x 

^N<C- x \\f -h\\\. 



Markov inequality yields the following upperbound for the expression (16) 



Var/(r L * 
C|||/-/o| 



*tiP» = Q) < rnfzidi - W 



Let us now state the following Lemma which evaluate the variance of the estimator Tl. 
Lemma 4. // Assumption 1 is verified, 
v ^ w fL^ 2 ||/- /o || 2 L 4 -+ 4 ||/-/o|| 2 L 2 " +1 ||/-/o|| 3 L 2 " +2 ||/-/ | 

v-/(t l ) < c 4 ^^ + iv* + n + n + ^v~ 

where C4 only depends on do and v. 
Proof. We have 



Var/CZi) = E((T L - E(T L ))(T L - E(T L )). 
Simple calculations entail that 



Va r/ (T L ) = -^ El^ilV* 



p*«2 |2 

'7712 I 

h,mi h,m 2 



4 

+ 



(N(N-l)) 2 
The term 



L h h / _ 

E E E E ( E SjimiC^iJSlxmiC^tt) E $ ta(4)V 2 (^ 

Jl, ^2 = 1 777l= — Zl 7772 = — h \Jl<72 «3<*4 



E E ^hmA Z h)^hm 1 {Z i2 ) Y, $l 2 m 2 (Z i3 )<!> l2m2 (Zi A ) 



\i\<i 2 73<*4 
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is bounded by 

E E E ( $ iimi(4)WWS $ l»(^)) 



ii<i2 i'i<ii 



Jl<«2 «3<*4 



iCrlCrin^^s/M + |E(*j 1 mi(^)*i 3 m a (^))riii=is,i a =i 



'4 



+|E($ /imi (Z)$ /2m2 (Z))| 2 l il=i4ii2=i3 



+ E(^ imi (^)^ 2m2 (Z))/*/^ 2 ln= i3 ,i 2 ^ 



p*Zl f*^2 



*Zl f*hi 



E(^ imi (Z)$ i2m2 (Z))/«Vm1l il ^3^=M+E(^ 1 m 1 (^)^ 2 m 2 (2))/m 1 Vm 2 2 li 1 =i 4 , i2 ^ 



'3 



+E(^ imi (Z)$ /2m2 (Z))/*/^l«^ 4 



«2=*3> 



Eventually we get that 



Var/Pi) 



E E 

il,Z 2 =l rn,\m,2 

1 
+ 

+ 

+ 



(iV-2)(iV-3) 
N(N-l) 



*h 1 2 1 f+h 1 2 

m.2 I 



i lf* (1 rlf 



(18) 



JV(iV-l) 
2(JV - 2) 
N(N-1) 
2(N-2 



\E(<S> limi (Z)<S> l2m2 (Z))\ 2 + |E($ Jimi (Z)^ 2m2 (Z))|' 



E(^ imi (Z)^ 2m2 (Z))/^/, 



m 2 



/m,2 



;V( v _^m^E($ /imi (Z)^ 2m2 (Z))/^/^ 
where £R(x) denotes the real part of x. We shall now upperbound each term that appears in the 



expression (18) above. 



> First term. Since Ej=i Em I /ml ^ 11/ ~ /oil , we obtain 
" v f (N-2)(N-3) \ , 2 , 2 

/Lv Z^ I N(N-l) i -j\Jm 1 \ \Jm 2 \ - 

l u l 2 =lm 1 m 2 v v ; 7 



11/ -/r 114 
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iV 



> Second term. Firstly 
|E($ Zimi (Z)$; 2m2 (Z))| 2 = / <S> hmi <5>l 2 m 2 fo + / ^Jimi^fa^C/z - /o) 

< 2|E /o (^ imi (Z)^ 2m2 (Z))| 2 + 2 [ 9> hmi *^(fz - fo) 

< 2\E h (^ hmi (Z)^ l2m2 (Z))\ 2 + 2||<^ imi $^ 2 ~|| 2 ||/ z - fo\\l 
We can remark that, under Assumption 1, 



Wfz ~ /o||! = £ |(/Z - /0) m | 2 < E H^6 E K/ " /0)™! 2 ^ HZ " /' 



„2 

0112 



lin 



since ||/* || op < 1 for all I. Now let us show that there exists C\ > such that 

E E H*Iimi*^lll < C,L^\ 
Zj i,=lrai,mj 
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We deduce from ( 14 ) that 

^|i^ imi (x)i 2 i$ /2m2 (x)i 2 dx < ^ir +i J\*i, m2 \ 2 <^i? +i j:\(f:x 



|2 

m2in\ 



nil 

-4 



~ 47T 1 2 

Then 

E E ii*n«u*^iil < ^ E E^ +1 ^^ +4 

(i,Ij=lmi,ma '1,^2=1 m 2 

and, using Lemma [3j 

L L 

E E iE(^ imi (z)$w^))i 2 < 2 e E i%o(^ imi (^)^ 2m2 (^))i 2 

h,l 2 =lmi,m2 l lt l 2 = lm 1 ,m 2 

+2C 1 L*»+ i \\f - f f 2 

< C 2 (L 4 ^ 2 + L 4 ^ 4 ||/-/ || 2 ). 

In the same way 

L 

E E |IE(^ imi (^)^ 2m2 (^))| 2 <C7 2 ( J L 4 ^ 2 + L 4 ^ 4 ||/-/o|| 2 ). 

l lt l 2 =l mi.mj 
Thus, the second term is bounded by a constant times L iu+2 /N 2 + L 4l,+4 \\f — fo\\ 2 /N 2 . 

> Third term. Using Cauchy Schwarz inequality we get 

L ^ 1/2 

e*h\2 
\J m\ I \J 

l l! l 2 =lmi,m2 \li,l2=l*ni,m2 

E E \n$hmAZ)$hm 2 (Z))\ 2 
, h,l 2 =l m\,m 2 

<VC~2 ||/ - fo\\ 2 2 (L 2 » +1 + L 2 » +2 \\f - /o|| 2 ). 
The third term is of order ||/ - f \\lL 2v+1 /N +\\f- f \\lL 2v+2 /N. 

\> Fourth term. We bound the fourth term in the same way as the third. 

Finally we have the bound for Vary(Tx,) 

L 4v+2 ||/-/o||j^ +4 \\f - fogL 2 ^ 1 \\f - fogL 2 ^ 2 ||/ -/o|| 4 

N 2 N 2 N N N ' 



E E f%f%K(*hniW**naW) < E E I^I'IC 



□ 
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This gives 
Vaxt(T L .) ( L* Av+2 L* Au+A L* 2u+1 L* 2u+2 



11/ -/oil 4 " 4 V^ 2 ll/-/oll 4 iV 2 ll/-/o|| 2 N\\f-fo\\ 2 iV||/-/ || N, 

Besides, as ||/ - / [|| > C2~ 2s L*~ 2s and TV > L* 2s+2iy+ Vloglog N, we get an upperbound for 
(17) in terms of L* 

/ 1 L* 2 - 2s 1 L* 1 - 5 1 

3 VC 2 log log TV + Clog log TV + CVloglogJV + ^C log log N + TV 

Since s > 1, all these terms tend to zero when iV goes to infinity. In conclusion for N large 
enough, F f (D N = 0) < rj/2. 

8.5 Proof of Theorem [4] 

This proof follows the same line as the one of Theorem [3] We first give an adaptation of Lemma [4] 
in order to control the variance of Tj,: 

/ r-4i/o+2-/3 . 11/ f II 2 r -4^o+4-2/3 

Var /( T L ) < Co (^-V" + ^ V" 

IIZ-ZofL- 2 ^ 1 -^ 2£g/ , ||/-/o|| 3 £-^ +2 -^ 2 ^/, , ||/-/o|| 4 \ U<)) 
+ TV + TV ^ N )' 

This result is obtained with standard integrals evaluation which give for any real a, 

£ l a e lf3/S < C r i x a e x "/ 8 dx < C'L a+1 -Pe L "/ 5 (20) 

1=1 -* 1 

(for L large enough if a < 0). Now, we evaluate the first type error. Using that Ej (Tj y *) = 0, 
we write 

F fo (D N = l) = F fo {\T L *\>K t 2 L ,)<K 2 t- L ?V a v f0 (T L *) 

< K^ 2 C a t- L tL*- Auo+2 ~ p exp (4L*P/5)N- 2 < Kq 2 C o L*- < rj/2 

for N large enough. To bound the error of the second kind, let / G Hi(s,R,Cip]\r). We have 

P/ODjv = 0) < P/ (\T L .\ < K t 2 L *) < F f (\T L * - E f (T L *)\ > \\f - f \\ 2 - K t 2 L , - B f (T L *)) . 

The definition of L* implies that, for N large enough 

-log(iV)J <L*<^-lo g (N) 

That ensures that L*~ 2s < (5/16)- 2s ^tp N and t\* < (Slog jv/8)(- 2 ^+i)//3 N ~3/4 < ^ f or N 
large enough. We set C\ = Kq + R 2 (6/16)~ 2s ^ and C2 = 1 — Ci/C (which is positive if C large 
enough). Markov inequality yields 

r ' (fl » = 0)s csfTF' (21) 

C 2 ||J - /oil 
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Using (19), we bound 



11/ -/oil 4 - °W 2 ll/-/oll 4 ^V 2 ll/-/o|| 2 

t*-2is + 1-I3/2 , r*-2i/ +2-/3 a i \ 

iV||/-/o|| 2 ^V||/-/o|| Nj- 

Besides, as ||/ — /o||| — C^L*~ 2s , we get the following upperbound 

( (log AT)(-^0+2-/3+4s)//3 Ar l/2 ( log N y-iv Q +i-2p+2s)/f3 N l/2 

Ci { N^ ' + N 2 ^ 

(l og Ar)(-2^+i-/3/2+2 S )//3 Ar i/4 (logiV)^ 2l/ '»+ 2 -^ +s )/' 9 iV 1 /4 i\ 
+ iV + iV + N ) ' 

and all these terms tend to zero when N goes to infinity. 

8.6 Proof of Theorem [5] 

The proof is analogous to the proof of Theorem [T] with hypothesis functions 

L 



m=—L 

where 
and 



f S = f0+ Yl °LmVL m , ¥(9 Lm = ± 7 ) = 1/2, 

m=—L 

7 2 = c x exp{-2L l3 /5)L- 2s+2uo - 1 
(2<51og(iV)) 1//3 



L 

This choice of L ensures that, for N large enough, 

(<Hog(A0) 1//3 < L < (25log(N)) 1 ^. 
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